Dataset statistics
| Number of variables | 36 |
|---|---|
| Number of observations | 1000000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 5 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 282.3 MiB |
| Average record size in memory | 296.0 B |
Variable types
| Numeric | 6 |
|---|---|
| Categorical | 30 |
| Dataset has 5 (< 0.1%) duplicate rows | Duplicates |
IN_TREINEIRO is highly overall correlated with TP_FAIXA_ETARIA and 1 other fields | High correlation |
Q002 is highly overall correlated with Q004 | High correlation |
Q003 is highly overall correlated with Q004 | High correlation |
Q004 is highly overall correlated with Q002 and 1 other fields | High correlation |
TP_ANO_CONCLUIU is highly overall correlated with TP_FAIXA_ETARIA | High correlation |
TP_ESCOLA is highly overall correlated with TP_ST_CONCLUSAO | High correlation |
TP_FAIXA_ETARIA is highly overall correlated with IN_TREINEIRO and 2 other fields | High correlation |
TP_ST_CONCLUSAO is highly overall correlated with IN_TREINEIRO and 2 other fields | High correlation |
TP_ESTADO_CIVIL is highly imbalanced (78.4%) | Imbalance |
TP_NACIONALIDADE is highly imbalanced (92.4%) | Imbalance |
Q007 is highly imbalanced (63.9%) | Imbalance |
Q009 is highly imbalanced (68.9%) | Imbalance |
Q012 is highly imbalanced (72.7%) | Imbalance |
Q015 is highly imbalanced (61.5%) | Imbalance |
Q017 is highly imbalanced (84.2%) | Imbalance |
Q022 is highly imbalanced (54.7%) | Imbalance |
Q025 is highly imbalanced (60.9%) | Imbalance |
TP_COR_RACA has 15812 (1.6%) zeros | Zeros |
TP_ANO_CONCLUIU has 620104 (62.0%) zeros | Zeros |
Reproduction
| Analysis started | 2024-04-15 03:28:05.601523 |
|---|---|
| Analysis finished | 2024-04-15 03:29:20.096321 |
| Duration | 1 minute and 14.49 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
TP_FAIXA_ETARIA
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.221059 |
| Minimum | 1 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 12 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 3.3507723 |
|---|---|
| Coefficient of variation (CV) | 0.79382267 |
| Kurtosis | 2.2593399 |
| Mean | 4.221059 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.6851032 |
| Sum | 4221059 |
| Variance | 11.227675 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 249816 | |
| 2 | 247484 | |
| 4 | 113460 | |
| 1 | 110210 | |
| 5 | 63935 | 6.4% |
| 6 | 40491 | 4.0% |
| 11 | 37393 | 3.7% |
| 7 | 28496 | 2.8% |
| 8 | 21052 | 2.1% |
| 12 | 20199 | 2.0% |
| Other values (10) | 67464 | 6.7% |
| Value | Count | Frequency (%) |
| 1 | 110210 | |
| 2 | 247484 | |
| 3 | 249816 | |
| 4 | 113460 | |
| 5 | 63935 | 6.4% |
| 6 | 40491 | 4.0% |
| 7 | 28496 | 2.8% |
| 8 | 21052 | 2.1% |
| 9 | 15538 | 1.6% |
| 10 | 12728 | 1.3% |
| Value | Count | Frequency (%) |
| 20 | 138 | < 0.1% |
| 19 | 347 | < 0.1% |
| 18 | 876 | 0.1% |
| 17 | 2203 | 0.2% |
| 16 | 3987 | 0.4% |
| 15 | 6529 | 0.7% |
| 14 | 10418 | 1.0% |
| 13 | 14700 | 1.5% |
| 12 | 20199 | |
| 11 | 37393 |
TP_SEXO
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 612383 | |
| 1 | 387617 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 612383 | |
| 1 | 387617 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 612383 | |
| 1 | 387617 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 612383 | |
| 1 | 387617 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 612383 | |
| 1 | 387617 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 612383 | |
| 1 | 387617 |
TP_ESTADO_CIVIL
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 1 | |
|---|---|
| 2 | 33937 |
| 0 | 29197 |
| 3 | 11554 |
| 4 | 729 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 924583 | |
| 2 | 33937 | 3.4% |
| 0 | 29197 | 2.9% |
| 3 | 11554 | 1.2% |
| 4 | 729 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 924583 | |
| 2 | 33937 | 3.4% |
| 0 | 29197 | 2.9% |
| 3 | 11554 | 1.2% |
| 4 | 729 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 924583 | |
| 2 | 33937 | 3.4% |
| 0 | 29197 | 2.9% |
| 3 | 11554 | 1.2% |
| 4 | 729 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 924583 | |
| 2 | 33937 | 3.4% |
| 0 | 29197 | 2.9% |
| 3 | 11554 | 1.2% |
| 4 | 729 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 924583 | |
| 2 | 33937 | 3.4% |
| 0 | 29197 | 2.9% |
| 3 | 11554 | 1.2% |
| 4 | 729 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 924583 | |
| 2 | 33937 | 3.4% |
| 0 | 29197 | 2.9% |
| 3 | 11554 | 1.2% |
| 4 | 729 | 0.1% |
TP_COR_RACA
Real number (ℝ)
ZEROS 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.977699 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 15812 |
| Zeros (%) | 1.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 3 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.0174285 |
|---|---|
| Coefficient of variation (CV) | 0.51445062 |
| Kurtosis | -1.3206045 |
| Mean | 1.977699 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.16764677 |
| Sum | 1977699 |
| Variance | 1.0351607 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 448821 | |
| 3 | 406770 | |
| 2 | 105255 | 10.5% |
| 4 | 18652 | 1.9% |
| 0 | 15812 | 1.6% |
| 5 | 4690 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 15812 | 1.6% |
| 1 | 448821 | |
| 2 | 105255 | 10.5% |
| 3 | 406770 | |
| 4 | 18652 | 1.9% |
| 5 | 4690 | 0.5% |
| Value | Count | Frequency (%) |
| 5 | 4690 | 0.5% |
| 4 | 18652 | 1.9% |
| 3 | 406770 | |
| 2 | 105255 | 10.5% |
| 1 | 448821 | |
| 0 | 15812 | 1.6% |
TP_NACIONALIDADE
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 1 | |
|---|---|
| 2 | 18778 |
| 4 | 2195 |
| 3 | 1509 |
| 0 | 290 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 977228 | |
| 2 | 18778 | 1.9% |
| 4 | 2195 | 0.2% |
| 3 | 1509 | 0.2% |
| 0 | 290 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 977228 | |
| 2 | 18778 | 1.9% |
| 4 | 2195 | 0.2% |
| 3 | 1509 | 0.2% |
| 0 | 290 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 977228 | |
| 2 | 18778 | 1.9% |
| 4 | 2195 | 0.2% |
| 3 | 1509 | 0.2% |
| 0 | 290 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 977228 | |
| 2 | 18778 | 1.9% |
| 4 | 2195 | 0.2% |
| 3 | 1509 | 0.2% |
| 0 | 290 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 977228 | |
| 2 | 18778 | 1.9% |
| 4 | 2195 | 0.2% |
| 3 | 1509 | 0.2% |
| 0 | 290 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 977228 | |
| 2 | 18778 | 1.9% |
| 4 | 2195 | 0.2% |
| 3 | 1509 | 0.2% |
| 0 | 290 | < 0.1% |
TP_ST_CONCLUSAO
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | |
| 4 | 3036 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 412561 | |
| 2 | 400724 | |
| 3 | 183679 | |
| 4 | 3036 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 412561 | |
| 2 | 400724 | |
| 3 | 183679 | |
| 4 | 3036 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 412561 | |
| 2 | 400724 | |
| 3 | 183679 | |
| 4 | 3036 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 412561 | |
| 2 | 400724 | |
| 3 | 183679 | |
| 4 | 3036 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 412561 | |
| 2 | 400724 | |
| 3 | 183679 | |
| 4 | 3036 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 412561 | |
| 2 | 400724 | |
| 3 | 183679 | |
| 4 | 3036 | 0.3% |
TP_ANO_CONCLUIU
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.750586 |
| Minimum | 0 |
|---|---|
| Maximum | 16 |
| Zeros | 620104 |
| Zeros (%) | 62.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2 |
| 95-th percentile | 11 |
| Maximum | 16 |
| Range | 16 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 3.6401167 |
|---|---|
| Coefficient of variation (CV) | 2.0793704 |
| Kurtosis | 6.8120528 |
| Mean | 1.750586 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.6983524 |
| Sum | 1750586 |
| Variance | 13.25045 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 620104 | |
| 1 | 127661 | 12.8% |
| 2 | 56276 | 5.6% |
| 3 | 42488 | 4.2% |
| 16 | 30840 | 3.1% |
| 4 | 28319 | 2.8% |
| 5 | 21150 | 2.1% |
| 6 | 15082 | 1.5% |
| 7 | 11896 | 1.2% |
| 8 | 9659 | 1.0% |
| Other values (7) | 36525 | 3.7% |
| Value | Count | Frequency (%) |
| 0 | 620104 | |
| 1 | 127661 | 12.8% |
| 2 | 56276 | 5.6% |
| 3 | 42488 | 4.2% |
| 4 | 28319 | 2.8% |
| 5 | 21150 | 2.1% |
| 6 | 15082 | 1.5% |
| 7 | 11896 | 1.2% |
| 8 | 9659 | 1.0% |
| 9 | 7648 | 0.8% |
| Value | Count | Frequency (%) |
| 16 | 30840 | |
| 15 | 3540 | 0.4% |
| 14 | 3843 | 0.4% |
| 13 | 4502 | 0.5% |
| 12 | 4942 | 0.5% |
| 11 | 5343 | 0.5% |
| 10 | 6707 | 0.7% |
| 9 | 7648 | 0.8% |
| 8 | 9659 | 1.0% |
| 7 | 11896 | 1.2% |
TP_ESCOLA
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 599276 | |
| 2 | 312405 | |
| 3 | 88319 | 8.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 599276 | |
| 2 | 312405 | |
| 3 | 88319 | 8.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 599276 | |
| 2 | 312405 | |
| 3 | 88319 | 8.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 599276 | |
| 2 | 312405 | |
| 3 | 88319 | 8.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 599276 | |
| 2 | 312405 | |
| 3 | 88319 | 8.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 599276 | |
| 2 | 312405 | |
| 3 | 88319 | 8.8% |
IN_TREINEIRO
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 816321 | |
| 1 | 183679 | 18.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 816321 | |
| 1 | 183679 | 18.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 816321 | |
| 1 | 183679 | 18.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 816321 | |
| 1 | 183679 | 18.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 816321 | |
| 1 | 183679 | 18.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 816321 | |
| 1 | 183679 | 18.4% |
TP_LINGUA
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 586152 | |
| 1 | 413848 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 586152 | |
| 1 | 413848 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 586152 | |
| 1 | 413848 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 586152 | |
| 1 | 413848 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 586152 | |
| 1 | 413848 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 586152 | |
| 1 | 413848 |
Q001
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 3 | |
|---|---|
| 2 | |
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 3 |
| 3rd row | 3 |
| 4th row | 2 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 544956 | |
| 2 | 256613 | |
| 1 | 198431 | 19.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 544956 | |
| 2 | 256613 | |
| 1 | 198431 | 19.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 544956 | |
| 2 | 256613 | |
| 1 | 198431 | 19.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 544956 | |
| 2 | 256613 | |
| 1 | 198431 | 19.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 544956 | |
| 2 | 256613 | |
| 1 | 198431 | 19.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 544956 | |
| 2 | 256613 | |
| 1 | 198431 | 19.8% |
Q002
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 2 | |
|---|---|
| 3 | |
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 3 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 582136 | |
| 3 | 293473 | |
| 1 | 124391 | 12.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 582136 | |
| 3 | 293473 | |
| 1 | 124391 | 12.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 582136 | |
| 3 | 293473 | |
| 1 | 124391 | 12.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 582136 | |
| 3 | 293473 | |
| 1 | 124391 | 12.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 582136 | |
| 3 | 293473 | |
| 1 | 124391 | 12.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 582136 | |
| 3 | 293473 | |
| 1 | 124391 | 12.4% |
Q003
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.110358 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.4568477 |
|---|---|
| Coefficient of variation (CV) | 0.46838586 |
| Kurtosis | -0.7696353 |
| Mean | 3.110358 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.22127323 |
| Sum | 3110358 |
| Variance | 2.1224052 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 238861 | |
| 3 | 233517 | |
| 2 | 192304 | |
| 1 | 169816 | |
| 5 | 93073 | 9.3% |
| 6 | 72429 | 7.2% |
| Value | Count | Frequency (%) |
| 1 | 169816 | |
| 2 | 192304 | |
| 3 | 233517 | |
| 4 | 238861 | |
| 5 | 93073 | 9.3% |
| 6 | 72429 | 7.2% |
| Value | Count | Frequency (%) |
| 6 | 72429 | 7.2% |
| 5 | 93073 | 9.3% |
| 4 | 238861 | |
| 3 | 233517 | |
| 2 | 192304 | |
| 1 | 169816 |
Q004
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.999099 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.4617005 |
|---|---|
| Coefficient of variation (CV) | 0.48737987 |
| Kurtosis | -0.82931442 |
| Mean | 2.999099 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.44834455 |
| Sum | 2999099 |
| Variance | 2.1365683 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 379187 | |
| 4 | 285647 | |
| 1 | 132872 | 13.3% |
| 6 | 74493 | 7.4% |
| 5 | 67452 | 6.7% |
| 3 | 60349 | 6.0% |
| Value | Count | Frequency (%) |
| 1 | 132872 | 13.3% |
| 2 | 379187 | |
| 3 | 60349 | 6.0% |
| 4 | 285647 | |
| 5 | 67452 | 6.7% |
| 6 | 74493 | 7.4% |
| Value | Count | Frequency (%) |
| 6 | 74493 | 7.4% |
| 5 | 67452 | 6.7% |
| 4 | 285647 | |
| 3 | 60349 | 6.0% |
| 2 | 379187 | |
| 1 | 132872 | 13.3% |
Q005
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 4 | |
|---|---|
| 3 | |
| 2 | |
| 1 | 19632 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 4 |
| 3rd row | 2 |
| 4th row | 3 |
| 5th row | 4 |
Common Values
| Value | Count | Frequency (%) |
| 4 | 584479 | |
| 3 | 281161 | |
| 2 | 114728 | 11.5% |
| 1 | 19632 | 2.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 4 | 584479 | |
| 3 | 281161 | |
| 2 | 114728 | 11.5% |
| 1 | 19632 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 584479 | |
| 3 | 281161 | |
| 2 | 114728 | 11.5% |
| 1 | 19632 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 584479 | |
| 3 | 281161 | |
| 2 | 114728 | 11.5% |
| 1 | 19632 | 2.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 584479 | |
| 3 | 281161 | |
| 2 | 114728 | 11.5% |
| 1 | 19632 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 584479 | |
| 3 | 281161 | |
| 2 | 114728 | 11.5% |
| 1 | 19632 | 2.0% |
Q006
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 2 | |
|---|---|
| 3 | |
| 4 | |
| 5 | 59012 |
| 1 | 47194 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 613086 | |
| 3 | 209566 | 21.0% |
| 4 | 71142 | 7.1% |
| 5 | 59012 | 5.9% |
| 1 | 47194 | 4.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 613086 | |
| 3 | 209566 | 21.0% |
| 4 | 71142 | 7.1% |
| 5 | 59012 | 5.9% |
| 1 | 47194 | 4.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 613086 | |
| 3 | 209566 | 21.0% |
| 4 | 71142 | 7.1% |
| 5 | 59012 | 5.9% |
| 1 | 47194 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 613086 | |
| 3 | 209566 | 21.0% |
| 4 | 71142 | 7.1% |
| 5 | 59012 | 5.9% |
| 1 | 47194 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 613086 | |
| 3 | 209566 | 21.0% |
| 4 | 71142 | 7.1% |
| 5 | 59012 | 5.9% |
| 1 | 47194 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 613086 | |
| 3 | 209566 | 21.0% |
| 4 | 71142 | 7.1% |
| 5 | 59012 | 5.9% |
| 1 | 47194 | 4.7% |
Q007
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 1 | |
|---|---|
| 2 | 54734 |
| 3 | 46171 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 899095 | |
| 2 | 54734 | 5.5% |
| 3 | 46171 | 4.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 899095 | |
| 2 | 54734 | 5.5% |
| 3 | 46171 | 4.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 899095 | |
| 2 | 54734 | 5.5% |
| 3 | 46171 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 899095 | |
| 2 | 54734 | 5.5% |
| 3 | 46171 | 4.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 899095 | |
| 2 | 54734 | 5.5% |
| 3 | 46171 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 899095 | |
| 2 | 54734 | 5.5% |
| 3 | 46171 | 4.6% |
Q008
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 2 | |
|---|---|
| 3 | |
| 1 | 5824 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 3 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 591320 | |
| 3 | 402856 | |
| 1 | 5824 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 591320 | |
| 3 | 402856 | |
| 1 | 5824 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 591320 | |
| 3 | 402856 | |
| 1 | 5824 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 591320 | |
| 3 | 402856 | |
| 1 | 5824 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 591320 | |
| 3 | 402856 | |
| 1 | 5824 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 591320 | |
| 3 | 402856 | |
| 1 | 5824 | 0.6% |
Q009
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 3 | |
|---|---|
| 2 | |
| 1 | 5340 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 3 |
| 3rd row | 3 |
| 4th row | 2 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 902075 | |
| 2 | 92585 | 9.3% |
| 1 | 5340 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 902075 | |
| 2 | 92585 | 9.3% |
| 1 | 5340 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 902075 | |
| 2 | 92585 | 9.3% |
| 1 | 5340 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 902075 | |
| 2 | 92585 | 9.3% |
| 1 | 5340 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 902075 | |
| 2 | 92585 | 9.3% |
| 1 | 5340 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 902075 | |
| 2 | 92585 | 9.3% |
| 1 | 5340 | 0.5% |
Q010
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 448824 | |
| 2 | 422128 | |
| 3 | 129048 | 12.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 448824 | |
| 2 | 422128 | |
| 3 | 129048 | 12.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 448824 | |
| 2 | 422128 | |
| 3 | 129048 | 12.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 448824 | |
| 2 | 422128 | |
| 3 | 129048 | 12.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 448824 | |
| 2 | 422128 | |
| 3 | 129048 | 12.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 448824 | |
| 2 | 422128 | |
| 3 | 129048 | 12.9% |
Q011
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 29556 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 743468 | |
| 2 | 226976 | 22.7% |
| 3 | 29556 | 3.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 743468 | |
| 2 | 226976 | 22.7% |
| 3 | 29556 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 743468 | |
| 2 | 226976 | 22.7% |
| 3 | 29556 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 743468 | |
| 2 | 226976 | 22.7% |
| 3 | 29556 | 3.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 743468 | |
| 2 | 226976 | 22.7% |
| 3 | 29556 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 743468 | |
| 2 | 226976 | 22.7% |
| 3 | 29556 | 3.0% |
Q012
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 2 | |
|---|---|
| 3 | 64210 |
| 1 | 11500 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 924290 | |
| 3 | 64210 | 6.4% |
| 1 | 11500 | 1.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 924290 | |
| 3 | 64210 | 6.4% |
| 1 | 11500 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 924290 | |
| 3 | 64210 | 6.4% |
| 1 | 11500 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 924290 | |
| 3 | 64210 | 6.4% |
| 1 | 11500 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 924290 | |
| 3 | 64210 | 6.4% |
| 1 | 11500 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 924290 | |
| 3 | 64210 | 6.4% |
| 1 | 11500 | 1.1% |
Q013
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 45011 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 494365 | |
| 2 | 460624 | |
| 3 | 45011 | 4.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 494365 | |
| 2 | 460624 | |
| 3 | 45011 | 4.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 494365 | |
| 2 | 460624 | |
| 3 | 45011 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 494365 | |
| 2 | 460624 | |
| 3 | 45011 | 4.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 494365 | |
| 2 | 460624 | |
| 3 | 45011 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 494365 | |
| 2 | 460624 | |
| 3 | 45011 | 4.5% |
Q014
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 13465 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 661972 | |
| 1 | 324563 | |
| 3 | 13465 | 1.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 661972 | |
| 1 | 324563 | |
| 3 | 13465 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 661972 | |
| 1 | 324563 | |
| 3 | 13465 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 661972 | |
| 1 | 324563 | |
| 3 | 13465 | 1.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 661972 | |
| 1 | 324563 | |
| 3 | 13465 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 661972 | |
| 1 | 324563 | |
| 3 | 13465 | 1.3% |
Q015
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 1886 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 855613 | |
| 2 | 142501 | 14.3% |
| 3 | 1886 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 855613 | |
| 2 | 142501 | 14.3% |
| 3 | 1886 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 855613 | |
| 2 | 142501 | 14.3% |
| 3 | 1886 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 855613 | |
| 2 | 142501 | 14.3% |
| 3 | 1886 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 855613 | |
| 2 | 142501 | 14.3% |
| 3 | 1886 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 855613 | |
| 2 | 142501 | 14.3% |
| 3 | 1886 | 0.2% |
Q016
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 8822 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 538141 | |
| 1 | 453037 | |
| 3 | 8822 | 0.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 538141 | |
| 1 | 453037 | |
| 3 | 8822 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 538141 | |
| 1 | 453037 | |
| 3 | 8822 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 538141 | |
| 1 | 453037 | |
| 3 | 8822 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 538141 | |
| 1 | 453037 | |
| 3 | 8822 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 538141 | |
| 1 | 453037 | |
| 3 | 8822 | 0.9% |
Q017
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 1 | |
|---|---|
| 2 | 39748 |
| 3 | 729 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 959523 | |
| 2 | 39748 | 4.0% |
| 3 | 729 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 959523 | |
| 2 | 39748 | 4.0% |
| 3 | 729 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 959523 | |
| 2 | 39748 | 4.0% |
| 3 | 729 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 959523 | |
| 2 | 39748 | 4.0% |
| 3 | 729 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 959523 | |
| 2 | 39748 | 4.0% |
| 3 | 729 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 959523 | |
| 2 | 39748 | 4.0% |
| 3 | 729 | 0.1% |
Q018
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 710781 | |
| 1 | 289219 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 710781 | |
| 1 | 289219 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 710781 | |
| 1 | 289219 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 710781 | |
| 1 | 289219 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 710781 | |
| 1 | 289219 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 710781 | |
| 1 | 289219 |
Q019
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 2 | |
|---|---|
| 3 | |
| 1 | 50746 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 614539 | |
| 3 | 334715 | |
| 1 | 50746 | 5.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 614539 | |
| 3 | 334715 | |
| 1 | 50746 | 5.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 614539 | |
| 3 | 334715 | |
| 1 | 50746 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 614539 | |
| 3 | 334715 | |
| 1 | 50746 | 5.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 614539 | |
| 3 | 334715 | |
| 1 | 50746 | 5.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 614539 | |
| 3 | 334715 | |
| 1 | 50746 | 5.1% |
Q020
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 811721 | |
| 1 | 188279 | 18.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 811721 | |
| 1 | 188279 | 18.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 811721 | |
| 1 | 188279 | 18.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 811721 | |
| 1 | 188279 | 18.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 811721 | |
| 1 | 188279 | 18.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 811721 | |
| 1 | 188279 | 18.8% |
Q021
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 742535 | |
| 1 | 257465 | 25.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 742535 | |
| 1 | 257465 | 25.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 742535 | |
| 1 | 257465 | 25.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 742535 | |
| 1 | 257465 | 25.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 742535 | |
| 1 | 257465 | 25.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 742535 | |
| 1 | 257465 | 25.7% |
Q022
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 3 | |
|---|---|
| 2 | |
| 1 | 19570 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 3 |
| 3rd row | 3 |
| 4th row | 3 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 840786 | |
| 2 | 139644 | 14.0% |
| 1 | 19570 | 2.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 840786 | |
| 2 | 139644 | 14.0% |
| 1 | 19570 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 840786 | |
| 2 | 139644 | 14.0% |
| 1 | 19570 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 840786 | |
| 2 | 139644 | 14.0% |
| 1 | 19570 | 2.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 840786 | |
| 2 | 139644 | 14.0% |
| 1 | 19570 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 840786 | |
| 2 | 139644 | 14.0% |
| 1 | 19570 | 2.0% |
Q023
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 858584 | |
| 1 | 141416 | 14.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 858584 | |
| 1 | 141416 | 14.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 858584 | |
| 1 | 141416 | 14.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 858584 | |
| 1 | 141416 | 14.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 858584 | |
| 1 | 141416 | 14.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 858584 | |
| 1 | 141416 | 14.1% |
Q024
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 405000 | |
| 2 | 403871 | |
| 3 | 191129 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 405000 | |
| 2 | 403871 | |
| 3 | 191129 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 405000 | |
| 2 | 403871 | |
| 3 | 191129 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 405000 | |
| 2 | 403871 | |
| 3 | 191129 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 405000 | |
| 2 | 403871 | |
| 3 | 191129 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 405000 | |
| 2 | 403871 | |
| 3 | 191129 |
Q025
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.3 MiB |
| 1 | |
|---|---|
| 0 | 76899 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1000000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 923101 | |
| 0 | 76899 | 7.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 923101 | |
| 0 | 76899 | 7.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 923101 | |
| 0 | 76899 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1000000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 923101 | |
| 0 | 76899 | 7.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1000000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 923101 | |
| 0 | 76899 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 923101 | |
| 0 | 76899 | 7.7% |
MEDIAS
Real number (ℝ)
| Distinct | 45208 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 546.88478 |
| Minimum | 0 |
|---|---|
| Maximum | 855.98 |
| Zeros | 8 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 406.06 |
| Q1 | 487.66 |
| median | 544.08 |
| Q3 | 605.84 |
| 95-th percentile | 696.14 |
| Maximum | 855.98 |
| Range | 855.98 |
| Interquartile range (IQR) | 118.18 |
Descriptive statistics
| Standard deviation | 88.021571 |
|---|---|
| Coefficient of variation (CV) | 0.16095085 |
| Kurtosis | -0.032176653 |
| Mean | 546.88478 |
| Median Absolute Deviation (MAD) | 58.9 |
| Skewness | 0.024465885 |
| Sum | 5.4688478 × 108 |
| Variance | 7747.7969 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 534.74 | 120 | < 0.1% |
| 542.7 | 118 | < 0.1% |
| 520.7 | 117 | < 0.1% |
| 562.8 | 117 | < 0.1% |
| 512.76 | 117 | < 0.1% |
| 518.28 | 116 | < 0.1% |
| 514.72 | 116 | < 0.1% |
| 533.34 | 116 | < 0.1% |
| 530.6 | 116 | < 0.1% |
| 552.3 | 115 | < 0.1% |
| Other values (45198) | 998832 |
| Value | Count | Frequency (%) |
| 0 | 8 | |
| 56.14 | 1 | < 0.1% |
| 66.1 | 1 | < 0.1% |
| 72.12 | 1 | < 0.1% |
| 89.12 | 1 | < 0.1% |
| 116 | 1 | < 0.1% |
| 127.12 | 1 | < 0.1% |
| 131.66 | 1 | < 0.1% |
| 136.24 | 1 | < 0.1% |
| 136.44 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 855.98 | 1 | |
| 855.82 | 1 | |
| 851.84 | 1 | |
| 849.86 | 1 | |
| 839.98 | 1 | |
| 839.54 | 1 | |
| 837.56 | 1 | |
| 837.06 | 1 | |
| 836.84 | 1 | |
| 836.76 | 1 |
| IN_TREINEIRO | MEDIAS | Q001 | Q002 | Q003 | Q004 | Q005 | Q006 | Q007 | Q008 | Q009 | Q010 | Q011 | Q012 | Q013 | Q014 | Q015 | Q016 | Q017 | Q018 | Q019 | Q020 | Q021 | Q022 | Q023 | Q024 | Q025 | TP_ANO_CONCLUIU | TP_COR_RACA | TP_ESCOLA | TP_ESTADO_CIVIL | TP_FAIXA_ETARIA | TP_LINGUA | TP_NACIONALIDADE | TP_SEXO | TP_ST_CONCLUSAO | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| IN_TREINEIRO | 1.000 | 0.006 | 0.140 | 0.170 | 0.108 | 0.121 | 0.084 | 0.169 | 0.116 | 0.143 | 0.077 | 0.155 | 0.027 | 0.068 | 0.085 | 0.082 | 0.046 | 0.093 | 0.062 | 0.105 | 0.116 | 0.068 | 0.115 | 0.068 | 0.043 | 0.124 | 0.052 | -0.358 | -0.069 | 0.388 | 0.087 | -0.591 | 0.111 | 0.013 | 0.036 | 1.000 |
| MEDIAS | 0.006 | 1.000 | 0.220 | 0.248 | 0.296 | 0.299 | 0.024 | 0.225 | 0.156 | 0.227 | 0.087 | 0.230 | 0.051 | 0.093 | 0.186 | 0.177 | 0.087 | 0.185 | 0.110 | 0.290 | 0.199 | 0.133 | 0.202 | 0.140 | 0.165 | 0.307 | 0.190 | 0.070 | -0.231 | 0.190 | 0.039 | -0.071 | 0.278 | 0.034 | 0.057 | 0.063 |
| Q001 | 0.140 | 0.220 | 1.000 | 0.402 | 0.446 | 0.391 | 0.052 | 0.294 | 0.127 | 0.236 | 0.116 | 0.244 | 0.047 | 0.102 | 0.175 | 0.228 | 0.096 | 0.206 | 0.094 | 0.298 | 0.215 | 0.118 | 0.243 | 0.162 | 0.161 | 0.268 | 0.209 | -0.153 | -0.189 | 0.148 | 0.104 | -0.260 | 0.236 | 0.023 | 0.058 | 0.134 |
| Q002 | 0.170 | 0.248 | 0.402 | 1.000 | 0.376 | 0.524 | 0.058 | 0.358 | 0.199 | 0.267 | 0.126 | 0.276 | 0.029 | 0.117 | 0.172 | 0.212 | 0.108 | 0.201 | 0.132 | 0.306 | 0.220 | 0.136 | 0.273 | 0.161 | 0.161 | 0.310 | 0.204 | -0.161 | -0.193 | 0.180 | 0.128 | -0.287 | 0.228 | 0.024 | 0.057 | 0.154 |
| Q003 | 0.108 | 0.296 | 0.446 | 0.376 | 1.000 | 0.537 | 0.044 | 0.340 | 0.284 | 0.320 | 0.116 | 0.337 | 0.105 | 0.163 | 0.225 | 0.279 | 0.147 | 0.264 | 0.196 | 0.389 | 0.287 | 0.192 | 0.349 | 0.182 | 0.220 | 0.368 | 0.247 | -0.113 | -0.193 | 0.219 | 0.044 | -0.184 | 0.275 | 0.021 | 0.063 | 0.105 |
| Q004 | 0.121 | 0.299 | 0.391 | 0.524 | 0.537 | 1.000 | 0.040 | 0.314 | 0.282 | 0.303 | 0.115 | 0.319 | 0.089 | 0.150 | 0.211 | 0.275 | 0.136 | 0.254 | 0.177 | 0.355 | 0.265 | 0.170 | 0.325 | 0.183 | 0.198 | 0.351 | 0.253 | -0.121 | -0.193 | 0.201 | 0.049 | -0.205 | 0.256 | 0.020 | 0.062 | 0.108 |
| Q005 | 0.084 | 0.024 | 0.052 | 0.058 | 0.044 | 0.040 | 1.000 | 0.078 | 0.050 | 0.095 | 0.216 | 0.142 | 0.076 | 0.073 | 0.066 | 0.065 | 0.028 | 0.056 | 0.026 | 0.073 | 0.135 | 0.059 | 0.076 | 0.235 | 0.061 | 0.077 | 0.055 | -0.135 | 0.037 | 0.086 | 0.078 | -0.147 | 0.045 | 0.007 | 0.029 | 0.106 |
| Q006 | 0.169 | 0.225 | 0.294 | 0.358 | 0.340 | 0.314 | 0.078 | 1.000 | 0.361 | 0.380 | 0.168 | 0.436 | 0.067 | 0.220 | 0.280 | 0.305 | 0.191 | 0.300 | 0.244 | 0.497 | 0.354 | 0.250 | 0.434 | 0.216 | 0.262 | 0.444 | 0.276 | -0.105 | -0.265 | 0.242 | 0.020 | -0.217 | 0.279 | 0.026 | 0.086 | 0.108 |
| Q007 | 0.116 | 0.156 | 0.127 | 0.199 | 0.284 | 0.282 | 0.050 | 0.361 | 1.000 | 0.211 | 0.053 | 0.262 | 0.039 | 0.172 | 0.178 | 0.129 | 0.128 | 0.157 | 0.198 | 0.253 | 0.201 | 0.165 | 0.283 | 0.055 | 0.152 | 0.238 | 0.067 | -0.096 | -0.141 | 0.160 | 0.022 | -0.144 | 0.144 | 0.015 | 0.036 | 0.093 |
| Q008 | 0.143 | 0.227 | 0.236 | 0.267 | 0.320 | 0.303 | 0.095 | 0.380 | 0.211 | 1.000 | 0.228 | 0.359 | 0.038 | 0.237 | 0.239 | 0.282 | 0.143 | 0.270 | 0.138 | 0.400 | 0.336 | 0.189 | 0.334 | 0.196 | 0.212 | 0.338 | 0.248 | -0.115 | -0.210 | 0.172 | 0.037 | -0.204 | 0.220 | 0.026 | 0.059 | 0.118 |
| Q009 | 0.077 | 0.087 | 0.116 | 0.126 | 0.116 | 0.115 | 0.216 | 0.168 | 0.053 | 0.228 | 1.000 | 0.195 | 0.069 | 0.155 | 0.119 | 0.155 | 0.064 | 0.137 | 0.039 | 0.162 | 0.189 | 0.094 | 0.141 | 0.245 | 0.084 | 0.147 | 0.240 | -0.111 | -0.087 | 0.062 | 0.063 | -0.165 | 0.094 | 0.009 | 0.019 | 0.087 |
| Q010 | 0.155 | 0.230 | 0.244 | 0.276 | 0.337 | 0.319 | 0.142 | 0.436 | 0.262 | 0.359 | 0.195 | 1.000 | 0.046 | 0.223 | 0.288 | 0.336 | 0.173 | 0.320 | 0.176 | 0.478 | 0.331 | 0.213 | 0.359 | 0.208 | 0.237 | 0.378 | 0.265 | -0.157 | -0.266 | 0.173 | 0.036 | -0.252 | 0.237 | 0.026 | 0.057 | 0.142 |
| Q011 | 0.027 | 0.051 | 0.047 | 0.029 | 0.105 | 0.089 | 0.076 | 0.067 | 0.039 | 0.038 | 0.069 | 0.046 | 1.000 | 0.039 | 0.039 | 0.048 | 0.029 | 0.039 | 0.033 | 0.064 | 0.049 | 0.033 | 0.050 | 0.057 | 0.070 | 0.054 | 0.052 | -0.041 | 0.057 | 0.055 | 0.017 | -0.047 | 0.075 | 0.011 | 0.031 | 0.032 |
| Q012 | 0.068 | 0.093 | 0.102 | 0.117 | 0.163 | 0.150 | 0.073 | 0.220 | 0.172 | 0.237 | 0.155 | 0.223 | 0.039 | 1.000 | 0.361 | 0.182 | 0.112 | 0.211 | 0.129 | 0.234 | 0.233 | 0.160 | 0.207 | 0.133 | 0.141 | 0.184 | 0.187 | -0.076 | -0.115 | 0.085 | 0.022 | -0.109 | 0.105 | 0.010 | 0.036 | 0.065 |
| Q013 | 0.085 | 0.186 | 0.175 | 0.172 | 0.225 | 0.211 | 0.066 | 0.280 | 0.178 | 0.239 | 0.119 | 0.288 | 0.039 | 0.361 | 1.000 | 0.284 | 0.202 | 0.283 | 0.138 | 0.365 | 0.257 | 0.213 | 0.290 | 0.168 | 0.185 | 0.271 | 0.210 | -0.126 | -0.206 | 0.112 | 0.061 | -0.183 | 0.202 | 0.018 | 0.030 | 0.097 |
| Q014 | 0.082 | 0.177 | 0.228 | 0.212 | 0.279 | 0.275 | 0.065 | 0.305 | 0.129 | 0.282 | 0.155 | 0.336 | 0.048 | 0.182 | 0.284 | 1.000 | 0.264 | 0.347 | 0.123 | 0.394 | 0.281 | 0.166 | 0.297 | 0.206 | 0.207 | 0.319 | 0.291 | -0.113 | -0.239 | 0.119 | 0.019 | -0.162 | 0.223 | 0.023 | 0.063 | 0.093 |
| Q015 | 0.046 | 0.087 | 0.096 | 0.108 | 0.147 | 0.136 | 0.028 | 0.191 | 0.128 | 0.143 | 0.064 | 0.173 | 0.029 | 0.112 | 0.202 | 0.264 | 1.000 | 0.163 | 0.192 | 0.253 | 0.150 | 0.145 | 0.203 | 0.076 | 0.100 | 0.173 | 0.097 | -0.076 | -0.114 | 0.075 | 0.023 | -0.098 | 0.100 | 0.011 | 0.027 | 0.057 |
| Q016 | 0.093 | 0.185 | 0.206 | 0.201 | 0.264 | 0.254 | 0.056 | 0.300 | 0.157 | 0.270 | 0.137 | 0.320 | 0.039 | 0.211 | 0.283 | 0.347 | 0.163 | 1.000 | 0.154 | 0.413 | 0.299 | 0.192 | 0.302 | 0.178 | 0.213 | 0.309 | 0.263 | -0.107 | -0.245 | 0.130 | 0.024 | -0.167 | 0.224 | 0.023 | 0.057 | 0.094 |
| Q017 | 0.062 | 0.110 | 0.094 | 0.132 | 0.196 | 0.177 | 0.026 | 0.244 | 0.198 | 0.138 | 0.039 | 0.176 | 0.033 | 0.129 | 0.138 | 0.123 | 0.192 | 0.154 | 1.000 | 0.236 | 0.131 | 0.147 | 0.186 | 0.037 | 0.126 | 0.179 | 0.053 | -0.057 | -0.118 | 0.109 | 0.011 | -0.086 | 0.109 | 0.015 | 0.040 | 0.052 |
| Q018 | 0.105 | 0.290 | 0.298 | 0.306 | 0.389 | 0.355 | 0.073 | 0.497 | 0.253 | 0.400 | 0.162 | 0.478 | 0.064 | 0.234 | 0.365 | 0.394 | 0.253 | 0.413 | 0.236 | 1.000 | 0.432 | 0.255 | 0.339 | 0.195 | 0.240 | 0.483 | 0.173 | -0.113 | -0.264 | 0.220 | 0.033 | -0.175 | 0.240 | 0.036 | 0.052 | 0.142 |
| Q019 | 0.116 | 0.199 | 0.215 | 0.220 | 0.287 | 0.265 | 0.135 | 0.354 | 0.201 | 0.336 | 0.189 | 0.331 | 0.049 | 0.233 | 0.257 | 0.281 | 0.150 | 0.299 | 0.131 | 0.432 | 1.000 | 0.253 | 0.386 | 0.210 | 0.245 | 0.333 | 0.234 | -0.122 | -0.214 | 0.167 | 0.046 | -0.190 | 0.226 | 0.028 | 0.079 | 0.108 |
| Q020 | 0.068 | 0.133 | 0.118 | 0.136 | 0.192 | 0.170 | 0.059 | 0.250 | 0.165 | 0.189 | 0.094 | 0.213 | 0.033 | 0.160 | 0.213 | 0.166 | 0.145 | 0.192 | 0.147 | 0.255 | 0.253 | 1.000 | 0.216 | 0.118 | 0.178 | 0.255 | 0.080 | -0.080 | -0.104 | 0.113 | 0.034 | -0.109 | 0.114 | 0.014 | 0.018 | 0.095 |
| Q021 | 0.115 | 0.202 | 0.243 | 0.273 | 0.349 | 0.325 | 0.076 | 0.434 | 0.283 | 0.334 | 0.141 | 0.359 | 0.050 | 0.207 | 0.290 | 0.297 | 0.203 | 0.302 | 0.186 | 0.339 | 0.386 | 0.216 | 1.000 | 0.168 | 0.231 | 0.375 | 0.151 | -0.124 | -0.166 | 0.207 | 0.042 | -0.177 | 0.172 | 0.022 | 0.022 | 0.150 |
| Q022 | 0.068 | 0.140 | 0.162 | 0.161 | 0.182 | 0.183 | 0.235 | 0.216 | 0.055 | 0.196 | 0.245 | 0.208 | 0.057 | 0.133 | 0.168 | 0.206 | 0.076 | 0.178 | 0.037 | 0.195 | 0.210 | 0.118 | 0.168 | 1.000 | 0.088 | 0.199 | 0.333 | -0.083 | -0.114 | 0.067 | 0.046 | -0.151 | 0.136 | 0.014 | 0.020 | 0.075 |
| Q023 | 0.043 | 0.165 | 0.161 | 0.161 | 0.220 | 0.198 | 0.061 | 0.262 | 0.152 | 0.212 | 0.084 | 0.237 | 0.070 | 0.141 | 0.185 | 0.207 | 0.100 | 0.213 | 0.126 | 0.240 | 0.245 | 0.178 | 0.231 | 0.088 | 1.000 | 0.262 | 0.104 | -0.033 | -0.124 | 0.133 | 0.025 | -0.066 | 0.135 | 0.021 | 0.032 | 0.051 |
| Q024 | 0.124 | 0.307 | 0.268 | 0.310 | 0.368 | 0.351 | 0.077 | 0.444 | 0.238 | 0.338 | 0.147 | 0.378 | 0.054 | 0.184 | 0.271 | 0.319 | 0.173 | 0.309 | 0.179 | 0.483 | 0.333 | 0.255 | 0.375 | 0.199 | 0.262 | 1.000 | 0.304 | -0.037 | -0.261 | 0.199 | 0.023 | -0.137 | 0.280 | 0.035 | 0.095 | 0.094 |
| Q025 | 0.052 | 0.190 | 0.209 | 0.204 | 0.247 | 0.253 | 0.055 | 0.276 | 0.067 | 0.248 | 0.240 | 0.265 | 0.052 | 0.187 | 0.210 | 0.291 | 0.097 | 0.263 | 0.053 | 0.173 | 0.234 | 0.080 | 0.151 | 0.333 | 0.104 | 0.304 | 1.000 | -0.046 | -0.128 | 0.084 | 0.018 | -0.097 | 0.131 | 0.017 | 0.033 | 0.064 |
| TP_ANO_CONCLUIU | -0.358 | 0.070 | -0.153 | -0.161 | -0.113 | -0.121 | -0.135 | -0.105 | -0.096 | -0.115 | -0.111 | -0.157 | -0.041 | -0.076 | -0.126 | -0.113 | -0.076 | -0.107 | -0.057 | -0.113 | -0.122 | -0.080 | -0.124 | -0.083 | -0.033 | -0.037 | -0.046 | 1.000 | 0.068 | 0.336 | 0.232 | 0.759 | 0.123 | 0.011 | 0.019 | 0.400 |
| TP_COR_RACA | -0.069 | -0.231 | -0.189 | -0.193 | -0.193 | -0.193 | 0.037 | -0.265 | -0.141 | -0.210 | -0.087 | -0.266 | 0.057 | -0.115 | -0.206 | -0.239 | -0.114 | -0.245 | -0.118 | -0.264 | -0.214 | -0.104 | -0.166 | -0.114 | -0.124 | -0.261 | -0.128 | 0.068 | 1.000 | 0.111 | 0.042 | 0.116 | 0.182 | 0.036 | 0.018 | 0.068 |
| TP_ESCOLA | 0.388 | 0.190 | 0.148 | 0.180 | 0.219 | 0.201 | 0.086 | 0.242 | 0.160 | 0.172 | 0.062 | 0.173 | 0.055 | 0.085 | 0.112 | 0.119 | 0.075 | 0.130 | 0.109 | 0.220 | 0.167 | 0.113 | 0.207 | 0.067 | 0.133 | 0.199 | 0.084 | 0.336 | 0.111 | 1.000 | 0.096 | -0.313 | 0.135 | 0.020 | 0.049 | 0.707 |
| TP_ESTADO_CIVIL | 0.087 | 0.039 | 0.104 | 0.128 | 0.044 | 0.049 | 0.078 | 0.020 | 0.022 | 0.037 | 0.063 | 0.036 | 0.017 | 0.022 | 0.061 | 0.019 | 0.023 | 0.024 | 0.011 | 0.033 | 0.046 | 0.034 | 0.042 | 0.046 | 0.025 | 0.023 | 0.018 | 0.232 | 0.042 | 0.096 | 1.000 | 0.195 | 0.088 | 0.011 | 0.020 | 0.117 |
| TP_FAIXA_ETARIA | -0.591 | -0.071 | -0.260 | -0.287 | -0.184 | -0.205 | -0.147 | -0.217 | -0.144 | -0.204 | -0.165 | -0.252 | -0.047 | -0.109 | -0.183 | -0.162 | -0.098 | -0.167 | -0.086 | -0.175 | -0.190 | -0.109 | -0.177 | -0.151 | -0.066 | -0.137 | -0.097 | 0.759 | 0.116 | -0.313 | 0.195 | 1.000 | 0.182 | 0.011 | 0.033 | 0.500 |
| TP_LINGUA | 0.111 | 0.278 | 0.236 | 0.228 | 0.275 | 0.256 | 0.045 | 0.279 | 0.144 | 0.220 | 0.094 | 0.237 | 0.075 | 0.105 | 0.202 | 0.223 | 0.100 | 0.224 | 0.109 | 0.240 | 0.226 | 0.114 | 0.172 | 0.136 | 0.135 | 0.280 | 0.131 | 0.123 | 0.182 | 0.135 | 0.088 | 0.182 | 1.000 | 0.034 | 0.096 | 0.140 |
| TP_NACIONALIDADE | 0.013 | 0.034 | 0.023 | 0.024 | 0.021 | 0.020 | 0.007 | 0.026 | 0.015 | 0.026 | 0.009 | 0.026 | 0.011 | 0.010 | 0.018 | 0.023 | 0.011 | 0.023 | 0.015 | 0.036 | 0.028 | 0.014 | 0.022 | 0.014 | 0.021 | 0.035 | 0.017 | 0.011 | 0.036 | 0.020 | 0.011 | 0.011 | 0.034 | 1.000 | 0.028 | 0.011 |
| TP_SEXO | 0.036 | 0.057 | 0.058 | 0.057 | 0.063 | 0.062 | 0.029 | 0.086 | 0.036 | 0.059 | 0.019 | 0.057 | 0.031 | 0.036 | 0.030 | 0.063 | 0.027 | 0.057 | 0.040 | 0.052 | 0.079 | 0.018 | 0.022 | 0.020 | 0.032 | 0.095 | 0.033 | 0.019 | 0.018 | 0.049 | 0.020 | 0.033 | 0.096 | 0.028 | 1.000 | 0.041 |
| TP_ST_CONCLUSAO | 1.000 | 0.063 | 0.134 | 0.154 | 0.105 | 0.108 | 0.106 | 0.108 | 0.093 | 0.118 | 0.087 | 0.142 | 0.032 | 0.065 | 0.097 | 0.093 | 0.057 | 0.094 | 0.052 | 0.142 | 0.108 | 0.095 | 0.150 | 0.075 | 0.051 | 0.094 | 0.064 | 0.400 | 0.068 | 0.707 | 0.117 | 0.500 | 0.140 | 0.011 | 0.041 | 1.000 |
| TP_FAIXA_ETARIA | TP_SEXO | TP_ESTADO_CIVIL | TP_COR_RACA | TP_NACIONALIDADE | TP_ST_CONCLUSAO | TP_ANO_CONCLUIU | TP_ESCOLA | IN_TREINEIRO | TP_LINGUA | Q001 | Q002 | Q003 | Q004 | Q005 | Q006 | Q007 | Q008 | Q009 | Q010 | Q011 | Q012 | Q013 | Q014 | Q015 | Q016 | Q017 | Q018 | Q019 | Q020 | Q021 | Q022 | Q023 | Q024 | Q025 | MEDIAS | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 440156 | 1 | 0 | 1 | 1 | 1 | 3 | 0 | 1 | 1 | 0 | 2 | 2 | 2 | 2 | 3 | 2 | 1 | 2 | 2 | 1 | 1 | 2 | 2 | 2 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 3 | 0 | 2 | 1 | 484.78 |
| 707341 | 3 | 0 | 1 | 1 | 1 | 2 | 0 | 2 | 0 | 1 | 3 | 2 | 3 | 3 | 4 | 2 | 1 | 3 | 3 | 2 | 2 | 2 | 1 | 2 | 1 | 2 | 1 | 0 | 2 | 1 | 0 | 3 | 0 | 2 | 1 | 540.66 |
| 1890223 | 2 | 0 | 1 | 3 | 1 | 2 | 0 | 2 | 0 | 1 | 3 | 3 | 4 | 4 | 2 | 2 | 1 | 2 | 3 | 2 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 3 | 0 | 1 | 1 | 522.98 |
| 1290733 | 3 | 0 | 1 | 3 | 1 | 2 | 0 | 2 | 0 | 1 | 2 | 2 | 3 | 1 | 3 | 2 | 1 | 2 | 2 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 3 | 0 | 1 | 0 | 549.00 |
| 115016 | 2 | 0 | 1 | 1 | 1 | 2 | 0 | 2 | 0 | 0 | 3 | 2 | 3 | 2 | 4 | 2 | 1 | 2 | 3 | 1 | 2 | 2 | 2 | 2 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 3 | 0 | 2 | 1 | 618.80 |
| 1533669 | 3 | 0 | 1 | 3 | 1 | 2 | 0 | 2 | 0 | 0 | 1 | 2 | 3 | 2 | 3 | 2 | 1 | 3 | 3 | 2 | 1 | 2 | 2 | 2 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 3 | 0 | 1 | 1 | 606.86 |
| 1795026 | 6 | 1 | 1 | 0 | 1 | 1 | 5 | 1 | 0 | 0 | 3 | 2 | 5 | 6 | 4 | 3 | 1 | 2 | 3 | 2 | 1 | 2 | 1 | 2 | 1 | 1 | 1 | 0 | 3 | 0 | 1 | 3 | 0 | 2 | 1 | 522.92 |
| 2270228 | 5 | 1 | 1 | 1 | 1 | 1 | 3 | 1 | 0 | 0 | 3 | 3 | 4 | 5 | 1 | 3 | 1 | 2 | 3 | 1 | 1 | 2 | 2 | 1 | 1 | 2 | 1 | 0 | 2 | 0 | 0 | 2 | 1 | 2 | 1 | 637.00 |
| 689516 | 3 | 1 | 1 | 1 | 1 | 2 | 0 | 2 | 0 | 1 | 2 | 2 | 1 | 1 | 4 | 2 | 1 | 2 | 3 | 2 | 2 | 2 | 2 | 1 | 1 | 2 | 1 | 0 | 2 | 0 | 0 | 3 | 0 | 1 | 1 | 603.04 |
| 1473162 | 11 | 0 | 0 | 3 | 1 | 1 | 8 | 1 | 0 | 0 | 2 | 2 | 4 | 4 | 3 | 2 | 1 | 3 | 3 | 2 | 1 | 2 | 2 | 2 | 1 | 2 | 1 | 0 | 3 | 1 | 0 | 3 | 1 | 2 | 1 | 540.66 |
| TP_FAIXA_ETARIA | TP_SEXO | TP_ESTADO_CIVIL | TP_COR_RACA | TP_NACIONALIDADE | TP_ST_CONCLUSAO | TP_ANO_CONCLUIU | TP_ESCOLA | IN_TREINEIRO | TP_LINGUA | Q001 | Q002 | Q003 | Q004 | Q005 | Q006 | Q007 | Q008 | Q009 | Q010 | Q011 | Q012 | Q013 | Q014 | Q015 | Q016 | Q017 | Q018 | Q019 | Q020 | Q021 | Q022 | Q023 | Q024 | Q025 | MEDIAS | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2019690 | 4 | 0 | 1 | 1 | 1 | 1 | 2 | 1 | 0 | 0 | 3 | 3 | 4 | 4 | 4 | 4 | 1 | 3 | 3 | 2 | 1 | 2 | 2 | 2 | 2 | 2 | 1 | 1 | 3 | 0 | 0 | 3 | 0 | 3 | 1 | 615.14 |
| 1420006 | 5 | 0 | 1 | 1 | 1 | 1 | 2 | 1 | 0 | 0 | 3 | 2 | 4 | 4 | 3 | 2 | 1 | 3 | 3 | 2 | 1 | 2 | 1 | 2 | 1 | 2 | 1 | 0 | 2 | 0 | 1 | 3 | 0 | 2 | 1 | 696.72 |
| 1746300 | 9 | 1 | 1 | 3 | 1 | 1 | 7 | 1 | 0 | 1 | 1 | 2 | 2 | 1 | 3 | 2 | 1 | 2 | 3 | 2 | 2 | 2 | 2 | 2 | 1 | 1 | 1 | 0 | 2 | 1 | 0 | 3 | 0 | 1 | 1 | 609.58 |
| 727365 | 6 | 1 | 1 | 3 | 1 | 1 | 3 | 1 | 0 | 0 | 1 | 2 | 1 | 1 | 3 | 2 | 1 | 2 | 3 | 1 | 2 | 2 | 1 | 2 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 3 | 0 | 2 | 1 | 542.56 |
| 1278903 | 3 | 1 | 1 | 3 | 1 | 1 | 1 | 1 | 0 | 0 | 3 | 3 | 4 | 4 | 3 | 2 | 1 | 3 | 3 | 1 | 2 | 2 | 1 | 1 | 1 | 2 | 1 | 0 | 2 | 0 | 0 | 3 | 0 | 1 | 1 | 539.34 |
| 1062964 | 10 | 0 | 1 | 3 | 1 | 1 | 6 | 1 | 0 | 1 | 1 | 2 | 3 | 6 | 4 | 2 | 1 | 2 | 3 | 1 | 1 | 2 | 1 | 2 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 3 | 1 | 3 | 1 | 521.12 |
| 1091290 | 5 | 0 | 1 | 3 | 1 | 1 | 2 | 1 | 0 | 1 | 2 | 2 | 3 | 1 | 2 | 2 | 1 | 3 | 3 | 1 | 2 | 2 | 1 | 2 | 1 | 2 | 1 | 0 | 1 | 0 | 0 | 3 | 0 | 1 | 1 | 456.28 |
| 2435736 | 3 | 1 | 1 | 1 | 1 | 2 | 0 | 2 | 0 | 1 | 2 | 2 | 2 | 2 | 1 | 2 | 1 | 2 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 2 | 1 | 0 | 1 | 0 | 1 | 2 | 0 | 1 | 1 | 418.74 |
| 2254884 | 4 | 0 | 1 | 3 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 3 | 2 | 1 | 2 | 3 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 2 | 0 | 1 | 1 | 562.22 |
| 776691 | 7 | 1 | 1 | 2 | 1 | 1 | 4 | 1 | 0 | 1 | 1 | 2 | 1 | 1 | 4 | 2 | 1 | 2 | 3 | 1 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 2 | 0 | 1 | 0 | 465.26 |
Most frequently occurring
| TP_FAIXA_ETARIA | TP_SEXO | TP_ESTADO_CIVIL | TP_COR_RACA | TP_NACIONALIDADE | TP_ST_CONCLUSAO | TP_ANO_CONCLUIU | TP_ESCOLA | IN_TREINEIRO | TP_LINGUA | Q001 | Q002 | Q003 | Q004 | Q005 | Q006 | Q007 | Q008 | Q009 | Q010 | Q011 | Q012 | Q013 | Q014 | Q015 | Q016 | Q017 | Q018 | Q019 | Q020 | Q021 | Q022 | Q023 | Q024 | Q025 | MEDIAS | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2 | 0 | 1 | 1 | 1 | 2 | 0 | 3 | 0 | 0 | 3 | 3 | 2 | 2 | 4 | 3 | 1 | 3 | 3 | 2 | 1 | 2 | 2 | 2 | 1 | 2 | 1 | 1 | 3 | 0 | 1 | 3 | 1 | 2 | 1 | 563.88 | 2 |
| 1 | 2 | 0 | 1 | 3 | 1 | 2 | 0 | 2 | 0 | 0 | 2 | 2 | 3 | 2 | 4 | 2 | 1 | 2 | 3 | 1 | 2 | 2 | 1 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 3 | 0 | 1 | 1 | 505.26 | 2 |
| 2 | 2 | 0 | 1 | 3 | 1 | 2 | 0 | 2 | 0 | 1 | 1 | 1 | 1 | 1 | 4 | 1 | 1 | 2 | 3 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 3 | 0 | 1 | 1 | 490.26 | 2 |
| 3 | 2 | 0 | 1 | 3 | 1 | 2 | 0 | 2 | 0 | 1 | 2 | 2 | 1 | 1 | 3 | 1 | 1 | 2 | 3 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 2 | 0 | 1 | 0 | 459.50 | 2 |
| 4 | 2 | 0 | 1 | 3 | 1 | 2 | 0 | 2 | 0 | 1 | 3 | 2 | 3 | 2 | 4 | 2 | 1 | 2 | 3 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | 1 | 0 | 2 | 0 | 0 | 3 | 0 | 1 | 1 | 529.14 | 2 |